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ITION OF NON-CD4 MEDIATED HIV INFECTION 



TECHNICAL FTELD OF THE INVENTION 

5 

The present invention is directed to a non-CD4 cell surface receptor for gpl20. 
This gpl20 receptor (gpl20r) has been isolated and cloned and is utilized in the present 
invention in methods and kits for the inhibition and detection of HIV infection. 

10 BACKGROUND OF THE INVENTION 

Two types of human retroviruses have been identified, leukemia viruses and AIDS- 
related viruses. The primary targets of the human retroviruses are T lymphocytes and cells 
of the central nervous system. All human "retroviruses are transmitted by intimate contact, 

15 blood contamination, and infection in utero or after birth by milk. It is likely that all 
human retroviruses originated in Africa and that they encountered the human species via 
interspecies infection, possibly from African green monkeys or a related species. The 
human retroviruses first discovered, Human T Lymphotropic Virus Type 1 (HTLV-1) and 
Human T Lymphotropic Virus Type E (HTLV-II), have a preferential tropism for T4 cells 

20 and some T8 cells, share significant sequence homology, and are mainly associated with T 
cell leukemias and lymphomas. The other group of human retroviruses, generally called 
Human Immunodeficiency Viruses (HTV), is discussed in greater detail below. There are 
two major differences between the two types of human retroviruses: (1) there is substantial 
genomic variability among various HTV isolates, whereas the genomes of HTLV-I and 

25 HTLV-n are stable; and (2) HTV entered human populations much more recently man 
HTLV-I or HTLV-n. 

The human immunodeficiency virus (HIV) is a cytopathic retrovirus and the 
causative agent of the acquired immunodeficiency syndrome (AIDS). Two forms of HTV 
have now been identified. The prototype virus, HTV-1, previously termed 

30 lymphadenopathy-associated virus (LAV) and Human T Lymphotropic Virus Type m 
(HTLV-HI), is responsible for the vast majority of reported AIDS cases worldwide. 
Another retrovirus, HTV-2, has been isolated primarily from West African patients with 
AIDS and is pathogenically related to HTV-1. On the genetic level, HTV-2 is actually more 
closely related to the simian immunodeficiency virus (STV), a retrovirus infecting 

35 monkeys. 

Over half of the people that have contracted AIDS in the United States have already 
died. As many as three million persons in this country may be asymptomatic carriers of 
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HIV and ate capable of transmitting the virus. It had been estimated in 1986 that 270,000 
cases of AIDS wfll have occurred in the United States by 1991 (U.S. Public Health 
Service, (1986), Public Health Rep. 101:341). The mortality rate from AIDS is 
disturbingly high, exceeding 80% within three years of diagnosis and possibly reaching 
5 100% over a longer period. 

Worldwide, the AIDS epidemic may involve some five to ten million presently 
infected persons. Particularly troublesome are statistics from the African continent where 
millions of individuals are believed infected with HIV, deaths range in the hundreds of 
thousands, and heterosexual transmission predominates. To date, there is neither a known 

10 cure for AIDS nor an effective vaccine against HTV infection. 

HIV is a member of the nontransforming, cytopathic lentivirus family of 
retroviruses. HIV causes a typically fatal disease characterized by severe 
immunodeficiency or neurodegenerative disease, or both. The primary basis for HIV 
induced immunosuppression is the depletion of the helper/inducer subset of T lymphocytes 

15 expressing the CD4 molecule (14 or CD4 + cells), which serves as a high affinity cell 
surfece receptor for the virus. T4 lymphocytes are involved directly or indirectly in the 
induction of nearly every immunologic function in the body, and their depletion results in 
susceptibility to a wide range of opportunistic infections and neoplasms. 

In addition to the T4 lymphocyte, other cells expressing the CD4 molecule are 

20 targets of HIV infection, especially monocyte-macrophages. HIV infection also results in 
serious B cell abnormalities including polyclonal activation, hyperganMnaglcfculinemia, 
elevated levels of circulating immune complexes, and autoantibodies. A decreased number 
of functional natural killer (NK) cells have also been observed in AIDS patients. 

Infection of CD4 + cells is initiated by the interaction of the CD4 molecule with the 

25 major HTV" envelope glycoprotein gpl20, an event which is followed by internalization and 
uncoanng of the virion, transcription of genomic RNA to DNA by virus-encoded reverse 
transcriptase, and integration of the resulting proviral DNA into host cell chromosomal 
DNA. Also, unintegrated proviral DNA accumulates in large amounts within infected cells 
and is probably a significant factor in HIV cytopathology (Shaw et at, (1984) Science 

30 226:1165). 

The depletion of CD4 + T cells appears to contribute significantly to the 
immunosuppression associated with AIDS. A primary cytopathic effect of the virus in 
vitro is fflV-induced syncytium formation. CD4, through its mteraction with gpl20 plays 
an important role in syncytium formation. However, it has been observed that molecules 
35 on the cell surface of uninfected cells other than CD4 are also involved in HIV-induced 
cell fusion (HUdreth et al. (1989) Science 2M: 1075-1078) . 
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Infection by HIV produces, in addition to AIDS, a set of neuropsychiatric disorders 
which are called the AIDS dementia complex (ADC) (Price et al., (1988)^22:586-592). 

The symptoms of ADC include cognitive impairment, apathy and motor 
dysfunctions, and may affect as many as 90% of AIDS victims. The underlying cause of 
5 ADC appears to be the death of brain cells and fflV-1 can be isolated from the brains of 
infected individuals (Ho et al, (1987) N. Eng. J. Med. 212:278-286). 

An early study suggested that the cellular attachment site for HTV in brain might be 
CD4 (Pert et al., (1986) Proc. Natl. Acad. Sci. USA 51:9254-9258) but attempts to 
replicate these findings were not successful (Kozlowski et al., (1989) Neurosci. Abstr. 
10 15:671). It now appears unlikely that the CD4 antigen is involved in the infection of 
brain-derived cells by HIV. Susceptibility of brain cells to infection with fflV-1 does not 
correlate with the level of expression of CD4 (Chang-Mayer et al., (1987) Proc. Natl. 
Acad. Sci. USA 84:3526-3530; Srinivasan et al., (1988) Arch. Virol. 28:135-141), and 
infection of brain-derived cells by HIV-1 is not blocked by anti-CD4 antibodies (Clapham 
15 et al., (1989) Nature 222:368-370; Li et al., (1990) J. Virol. 64:1383-1387). 

The present invention demonstrates the presence of a non-CD4 receptor for gpl20 
and a method for the inhibition of HIV infection of cells such as brain and muscle which 
do not express high levels of CD4. 



20 STTMMARY OF THE 



Many cells that are susceptible to HTV infection appear to bind gpl20 through a 
non-CD4 surface protein. The present invention has identified this non-CD4 gpl20 
receptor (gpl20r) and has recombinantly expressed and characterized gpl20r. 

25 In this invention a specific non-CD4 gpl20r has been isolated which has specific 

binding activity for gpl20 present on Human Immunodeficiency Virus-1 (HIV). This 
gpl20r has a molecular weight of about 45, 000 daltons, contains about 400 amino acid 
residues and is characterized by a Kd for gpl20 of about 1.3 nM to about 2.0 nM. The 
binding of gpl20 to gpl20r is inhibited by specific carbohydrates, such as mannose and 

30 fucose, plant lectins such as concanavalin A and specific antibiotics, such as pradimicin A. 

In one embodiment of the present invention, a cDNA molecule that transcribes an 
mRNA encoding for gpl20r is cloned and expressed to produce gpl20r. The DNA is 
selected from a gene library obtained from tissue such as placenta, brain, muscle and 
colon. 

35 A method of inhibiting HTV infection of mammalian cells, such as brain, muscle 

and neural cells, is contemplated by the present invention. In this method, cells are 
contacted with an effective amount of an appropriate inhibitor of gpl20r binding for a time 
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period sufficient to significantly inhibit the binding of HIV to the non-CD4 protein, 
gpl20r. Specific inhibitors of gpl20r binding include mannose carbohydrates, fucose 
carbohydrates, plant lectins, and antibiotics such as pradinucin A. 

The gpl20r of the present invention can also be utilized in a method and a kit for 
the detection of the presence of HIV in a fluid sample. In this method, the binding of HTV 
to gp 120r is detected by an indicating means such as a labelled antibody capable of binding 
to the HIV-gpl20r reaction product. It is also contemplated that the gpl20r can be affixed 
to a solid matrix to form a solid support that is useful in this method and/or kit. 

pffl? r pTPTTON O F TRF. FIGURES 

In the drawings: 

FIGURE 1 illustrates expression cloning of the gp!20r cDNA and comparison to 

CD4 

A- Autoradiography of gpl20 binding to gpl20r and CD4 expressed in COS 
cells. A-F f^vgpm A, gpl20r; B. gpl2Qr with G17-2; C, gpl20r with 
200 nM unlabelled bgpl20; D, CD4; E, CD4 with G17-2; F, CD4 with 
bgpl20. G-L [^ngpnO; G, gpl02r, H, gpl20r with 110.1; I, gpl20r 
with bgpl20; J, CD4: k, CD4 with 110.1; L, CD4 with bgpl20. 

B: Inhibitioii of [^rjvgpHO binding to gpl20r and CD4. A-F gpl20r and G- 
L CD4. A+G, HTV antisera (1:20; Trimar); B+H, D-galactose (100 mM); 
C+I, D-mannose (100 mM); D+J, L-tucose (100 mM); E+K, 
Concanavalin A (1 mg/ml); F+L, pradimicin A (100 fig/xaL). 

C: gpl20r binding of HIV. A, HTV; B, HIV with 200 nM bgpl20. 

FIGURE 2 illustrates the characterization of the gpl20r. 

A: Scatchard analysis of [ 125 qgpl20 binding. A - A, vgpl20 binding to 
placenta, Kd 1.3 nM, B^ 19 fmol/mg protein; ■ with pg/ml G17-2; • - 
•, vgpl20 binding to gpl20r COS cells, Kd 1.7 nM, B^ 150,000 
receptors/cell (R/C); O, ngpl20, Kd 1.8 nM, 149,000 R/C. 

B: Inhibition of [^gpl20 binding to gpl20r COS cells. Open symbols 
ngpl20, fffled symbols vgpl20. The relative values were the same with 
both forms of gpl20. Mannan expressed as mg/ml. □, mannan (IC50 6 
Mg/ml); •, L-fucose (K £ 6 mM); A, ce-methyl D-mannoside (K 15 mM), 
O, D-mannose QL 23 mM); O, N-acetylglucosaniine (K. 70 mM), 
EGTACKjO.3 mM). 
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C: Internalization of gpl20by gpl20r COS cells. Points represent the mean of 
two experiments with vgpl20 and ngpl20. • - suface; O - O internal. 

D: Placenta control sera; 2, placenta HTV sera; 3, gpl20r COS control sera; 4, 
gpl20r COS HIV sera. 

5 E: Northern blot of gpl20r expression. Polyadenylated (A + ); 2, placenta; 3, 

thymus; 4+12, forebrain; 5, skeletal muscle; 6, heart; 7, liver; 8, kidney; 
9, colon; 10 medulla; 11, cerebellum; 13, T cell (CEM; 16 fig A+) 14, B 
cell (TS-1; 16 pg A+); 15, macrophage (U937; 8 fig A+); 16, cervical 
carcinoma (HeLa; 16 ng A + ). The different apparent size of the ~5 kb 

10 band is an artifact of displacement by 28S rRNA. 



FIGURE 3 illustrates the sequence analysis of the gp!20r. 
A: Nucleotide and deduced protein sequence of gpl20r cDNA. 
B: Hydropathicity plot of the gpl20r. The predicted transmembrane segment 
15 and the start of the eight amphipathic repeats are indicated by arrows. 

C: Aminoacid alignment of the gpl20r C-type lectin domain. 

DESCRIPTION OF PREFERRED EMBODIMENTS 



20 HIV infection of brain and muscle cell lines is not blocked by soluble CD4 or anti- 

CD4 antibodies (Clapham, P.R. et al., (1989) Nature 222:368-370; Harouse, J.M. et al., 
(1989) J. Virol. £2:2527-2533; Weber, J. et al., (1989) J. Gen. Virol. 20:2653-2660). 
This is consistent with the existence of a second gpl20 receptor. Binding studies indicated 
that human placenta was another source for a non-CD4 gjpl20 receptor, and a cDNA for a 

25 second gpl20 receptor (gp!20r) was isolated by the present invention from a placental 
library. The gpl20r has a higher binding affinity for gpl20 than CD4. Sequence analysis 
revealed homology to membrane associated C-type lectins, and inhibition studies have 
shown that the receptor binds gpl20 through a mannose or fucose containing carbohydrate. 
The gpl20r rapidly internalizes gpl20, and is expressed in placenta, thymus, muscle, and 

30 colon. These results, when considered with previous studies on the role of gpl20 
carbohydrate in HIV infection (Lifson, J. et al., (1986) J. Exp. Med. 1£4:2101-2106; 
Ezekowitz, R.A.B. et al., (1989) J. Exp. Med. 162:185-196; Larkin M. et al., (1989) 
AIDS 2: 793-798; Tanabe-Tochikura A. et al., (1990) Virology J7&473-476), suggest a 
potential role for the gpl20r in HIV infection or pathology. 

35 The present invention demonstrates that the gpl20r participates in cellular binding 

of HTV by a non-CD4 pathway in muscle and brain, as well as, facilitating virus 
attachment in CD4 positive cell types. It is likely that the gpl20r plays a significant role in 
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transplacental transport of HIV (Zacher, V. et aL, (1991) J. Virol. fi£:2102-2107) and 
colon infection. (Bamett, S.W. et al. (1991) ViroL 122:802-809). Gpl20 produces an 
increase in intracellular calcium in rat retinal ganglion cells (Drcyer, E.B. et al., (1990) 
Science 248:364-367) suggesting that the gpl20r or a homologous protein may have 
5 signaling functions in the nervous system disrupted by gpl20 leading to HIV neurotoxicity. 

In the present invention, a new non-CD4 binding protein, or receptor, for gpl20 
was isolated. The HIV surface protein gpl20 was found to bind to a receptor on human 
placental membranes that was not blocked by antibodies directed against CD4, such as 
G17-2 and OKT4a, and which interfere with gpl20 binding to CD4. A cDNA encoding 

io this receptor was isolated from a placental cDNA library in a m a mm a li a n expression vector 
(pCDM8). The gene products were expressed in COS cells and were screened by I- 
labelled gpl20 binding. From a pool of 90,000 cDNA molecules, a single clone was 
isolated that encoded a protein which bound gpl20, even in the presence of concentrations 
of anti-CD4 antibody (G17-2) which completely blocked gpl20 binding to CD4. 

15 Sequence studies were carried out and indicated that the 1.5 kilobase cDNA clone 

encoded a previously unknown member of a family of Type H membrane proteins with an 
extracellular C type lectin domain. 

The cloned gpl20r of the present invention binds gpl20 with an affinity (Kd) of 
about 1 to 2nM, which is considerably greater than the affinity of CD4 for gpl20 (about 

20 Kd = 4 nM). 

The binding of gpl20 to gpl20r is not blocked by polyclonal HTV antisera, but is 
inhibited by mannose carbohydrates, fucose carbohydrates, plant lectins such as 
concanavalin A and pradimicin A antibiotics. Other sugars such as N-acetyl-d-glucosamine 
and galactose are less potent inhibitors. 

25 The gp!20r is expressed on many mammalian cells which do not exhibit high levels 

of CD4, such as placenta, skeletal muscle, brain, and mucosal cells. Other tissue and cells 
displaying gpl20r include colon, thymus, heart, T cells, B cells and macrophages. The 
distribution of tissue having gpl20r parallels that for binding of gpl20 which is not 
blocked by CD4 antibodies, and for HTV infection which is not neutralized by soluble 

30 CD4. This observation suggests a role for gp!20r in viral infection. 

In gpl20r expressing transfected COS cells, gpl20 is rapidly internalized following 
binding to gpl20r. This binding and mternalization of gpl20 is inhibited by compounds 
such as mannan, concanavalin A and pradimicin A. 

In die present invention a cDNA which encodes gpl20 was isolated and cloned. A 

35 DNA molecule of the present invention corresponds to a complementary DNA molecule 
which transcribes a messenger RNA (mRNA) molecule which, when translated, encodes 
gpl20r. The cDNA molecules were obtained by reverse-transcribing mRNA molecules 
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isolated from mammalian tissue such as placenta, colon, brain or thymus. The 
transcription and cloning of cDNA molecules and isolation of gene products are techniques 
well known in the art and, for example, are described in Sambrook et al., "Molecular 
Cloning: A Laboratory Manual " . 2d edition, Cold Spring Harbor Lab., Cold Spring 
5 Harbor, NY (1989), which is incorporated herein by reference. 

As used herein, the phrases "physiologically tolerable" and "pharmaceutical^ 
acceptable" refer to molecular entities and compositions that do not produce an allergic or 
similar untoward reaction, such as gastric upset, dizziness and the lite, when administered 
to a mammal. The physiologically tolerable carrier may take a wide variety of forms 
10 depending upon the preparation desired for administration and the intended route of 
administration. 

A carrier is a material useful for administering the active compound and must be 
"acceptable" in the sense of being compatible with the other ingredients of the composition 
and not deleterious to the recipient thereof. 
15 The pharmaceutical compositions are prepared by any of the methods well known in 

the art of pharmacy all of which involve bringing into association the active compound and 
the carrier therefor. 

For therapeutic use, the agent utilized in the present invention can be administered 
in the form of conventional pharmaceutical compositions. Such compositions can be 
20 formulated so as to be suitable for oral or parenteral administration, or as suppositories. In 
these compositions, the agent is typically dissolved or dispersed in a physiologically 
tolerable carrier. 

As an example, the compounds of the present invention can be utilized in liquid 
compositions such as sterile suspensions or solutions, or as isotonic preparations containing 

25 suitable preservatives. Particularly well suited for the present purposes are injectable 
media constituted by aqueous injectable isotonic and sterile saline or glucose solutions. 
Additional liquid forms in which the present compounds may be incorporated for 
administration include flavored emulsions with edible oils such as cottonseed oil, sesame 
oil, coconut oil, peanut oil, and the like, as well as elixirs and similar pharmaceutical 

30 vehicles. 

The present agents can also be administered in the form of liposomes. As is known 
in the art, liposomes are generally derived from phospholipids or other lipid substances. 
Liposomes are formed by mono* or multi-lamellar hydrated liquid crystals that are 
dispersed in an aqueous medium. Any non-toxic, physiologically acceptable and 
35 metabolizable lipid capable of forming liposomes can be used. The present compositions 
in liposome form can contain, in addition to the agent of the present invention, stabilizers, 
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preservatives, expedients, and the like. The preferred lipids are the phospholipids and the 
phosphatidyl cholines (lecithins), both natural and synthetic. 

Methods to form liposomes are known in the art. See, for example, Prescott, Ed., 
- M p T hnH« in Pril Biology ". Volume XIV, Academic Press, New York, N.Y. (1976) p 33 
etseq. 

The present compounds can also be used in compositions such as tablets or piHs, 
preferably containing a unit dose of the compound. To this end, the agent (active 
ingredient) is mixed with conventional tabletting ingredients such as com starch, lactose, 
sucrose, sorbitol, talc, stearic acid, magnesium stearate, dicalcium phosphate, gums or 
similar materials as non-toxic, physiologically tolerable carriers. The tablets or pills of the 
present compositions can be laminated or otherwise compounded to provide unit dosage 
forms affording prolonged or delayed action. 

It should be understood that in addition to the aforementioned carrier ingredients the 
pharmaceutical formulation described herein can include, as appropriate, one or more 
15 additional carrier ingredients such as diluents, buffers, flavoring agents, binders, surface 
active agents, thickeners, lubricants, preservatives (including antioxidants) and the like, 
and substances included for the purpose of rendering the formulation isotonic with the 
blood of the intended recipient. 

The tablets or pills can also be provided with an enteric layer in the form of an 
20 envelope that serves to resist disintegration in the stomach and permits the active 
ingredient to pass intact into the duodenum or to be delayed in release. A variety of 
materials can be used for such enteric layers or coatings, including polymeric adds or 
mixtures of such acids with such materials as shellac, shellac and cetyl alcohol, cellulose 
acetate, and the like. A particularly suitable enteric coating comprises a styrene-maleic 
25 acid copolymer together with known materials that contribute to the enteric properties of 
the coating. 

A method of inhibiting HIV infection of mammalian cells is disclosed in the present 
invention. A pharmaceutical composition containing a compound which effectively inhibits 
the binding of gpl20r to HTV, is contacted with cells either in vitro or in vivo for a time 
30 period sufficient to agnificantly inhibit the binding of HIV to the cell surface. 

Compounds effective in this method include mannose carbohydrates, fucose 
carbohydrates, plant lectins and pradiniicin A antibiotics. Specifically preferred 
compounds are mannose, fucose, mannan, concanavalin A and pradimirin A. The 
pharmaceutical composition of the present invention includes a compound which effectively 
35 inhibits gpl2Qr binding to HTV and may also include a physiologically tolerable carrier. 

The method of the present invention is preferably utilized to inhibit HIV infection 
of placental, brain, muscle, neural and colon cells. 
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A diagnostic method is also described in the present invention for detecting the 
presence, and preferably the amount, of HTV present in a fluid sample by producing a 
reaction product containing HIV bound to gpl20r. Those skilled in the art will recognize 
that there are well known clinical diagnostic procedures that can be utilized for the 
5 formulation and detection of such reaction products. Thus, while exemplary assay methods 
are described herein, the invention is not intended to be so limited. 

Various heterogeneous and homogeneous assay protocols can be employed for 
detecting the presence, and preferably the amount, of HIV in a fluid sample. For example, 
the present invention contemplates a method for assaying a sample, such as a body fluid, 
10 for the presence of HIV comprising the steps of: 

(a) admixing a fluid sample with gpl20r, either in solution or affixed to a solid 
matrix; 

(b) maintaining the admixture for a predetermined time period such as about 10 
minutes to about 16 - 20 hours and under biological assay conditions at a 

15 temperature of about 4°C to about 45°C that is sufficient for any HTV 

present in the sample to react with (bind) the gpl20r to form a reaction 
product; and 

(c) determining the presence of any reaction product that is formed, and thereby 
the presence of any HIV in the admixture. 

20 Preferably, the fluid sample is a body fluid sample, such as blood, plasma, serum, 

urine, saliva, semen or cerebrospinal fluid (CSF). 

The determination of the presence of a reaction product, either directly or 
indirectly, can be accomplished by assay techniques well known in the art such as by the 
use of an indicating or labelling means, as discussed hereinbelow. In a preferred 

25 embodiment, a labelled indicating means, such as a fluorescein-labelled antibody, is 
capable of binding to the gpl20r present in the reaction product to form a labelled 
complex. Determining the presence of the labelled complex provides an assay for the 
presence of HIV in the sample. In particularly preferred embodiments, the amount of 
labelled indicating means bound as part of the complex is determined, and thereby the 

30 amount of HIV present in the sample is determined. When that amount is zero, no HIV is 
present in the sample, within the limits of detection. Methods for assaying the presence 
and amount of a labelled indicating means depend on the label used, such labels and assay 
methods being well known in the art. 

In a preferred embodiment, the gpl20r is affixed on a solid matrix to form a solid 

35 phase support. In that embodiment, the assay is heterogeneous, solid/liquid phase assay 
and, as such, has its own preferred manipulations. For example, following admixing of a 
liquid sample with a solid support containing gp!20r affixed thereto, the admixture is 
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also a detection means. In one embodiment, the package can contain a microtiter plate 
well to which microgram quantities of gpl20r have been operatively affixed, ie., linked so 
as to be capable of reacting with and bind HIV and/or gpl20. 

As used herein, the terms "label" "indicating means 0 and "labelled indicating 
5 means", in their various grammatical forms refer to single atoms and molecules that are 
either directly or indirectly involved in the production of a detectable signal to indicate or 
detect the presence of a reaction product Such labels are themselves well known in 
clinical diagnostic chemistry and constitute a part of this invention only insofar as they are 
utilized with otherwise novel methods and/or systems. 

10 The indicating means can be a fluorescent labelling agent that chemically binds to 

antibodies or protein antigens without denaturing them to form a fluorochrome (dye) that is 
a useful immunofluorescent tracer. Suitable fluorescent labelling agents are fluorochrome, 
such as fluorescein isocyanate (FIC), fluorescein isothiocyanate (FTTC), 5-dimethylamine- 
1-naphthalene sulfonyl chloride (DANSC), tetramethylrhodamine isocyanate (TRTTC), 

15 lissamine and the like. Immunofluorescence analysis techniques are well known in the art, 
and for example, is described in DeLuca, "Immunofluorescence Analysis" in 
Immunofluorescence Analysis . Marchalonis et al., (1982) eds., John Wiley & Sons, Ltd., 
pp. 189-231, which is incorporated herein by reference. 

Other preferred indicating means are colorimetric agents and enzymes, such as 

20 horseradish peroxidase, glucose oxidase or the like, linked as described above, as well as 
radioactive elements, preferably an element that produces gamma ray emissions. Elements 
which emit gamma rays, , such as 124 I, 125 I, 128 I, 132 I, and 51 Cr represent one class of 
radioactive indicating groups. Another group of useful labelling means are those elements 
such as ^C, 18 F, 15 0 and 13 N which emit positrons. The positrons so emitted produce 

25 gamma rays upon interaction with electrons present 

Having generally described this invention, a further understanding can be obtained 
by reference to certain specific examples which are provided herein for purposes of 
illustration only and are not intended to be limiting unless otherwise specified. 

30 EXAMPLE 1 

Cloning and Isolation of Non-CD4 Gd140 Receptor Protein 

Human placental membranes were found to be able to bind vaccinia derived 
recombinant gpl20 (vgpl20) with a Kd of 1.3 nM. At nM (concentrations) of gpl20 none 
35 of this binding was inhibited by an antibody (G17-2) which has been reported to efficiently 
block gpl20 binding to CD4 (Iinsley et al. (1988) J. Virol. &:3695-3702), as shown in 
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FIGURE 2A. Approximately 50 - 90% of the total placental gpl20 binding was not due to 
CD4. 

A placental cDNA library was obtained in the mamma l i a n expression vector 
pCDM8 and was screened. A cDNA was isolated which expressed proton that exhibited 
high affinity binding for vgpl20 in the presence of G17-2. 

This protein, designated as gpl20 receptor (gpl20r), also bound native gpl20 
(ngpl20), and the binding component was precipitated in the presence of an antibody 
directed against gpl20. 

EXAMPLE 2 



The binding of relabelled gpl20 to gpl20r expressed in COS-7 cells was 
studied. Pools of 90,000 cDNA molecules, obtained from a placental pCDM8 library, 

15 were transfected by electroporation into COS-7 cells. Cells whfch expressed gpl20r oirjhe 
surface was identified by screening with either 1 nM of 125 I-labelled vgpl20 ( I- 
vgpl20) or I25 I-ngpl20 by the method described in Kozlowski et al., (1990) Antivir. 
Chem. Chemother. 1:175-182, incorporated herein by reference. The results of binding 
studies utilizing the transfected COS-7 cells are shown in FIGURE 1. 

20 Binding of labelled gpl20 (1 nM) to the cells was carried out following a 1 hour 

preincubarion of the cells or GP120 at 22°C with one or more of the following: anti-CD4 
antibody G17-2 (5 ug/ml), baculovirus-derived gpl20 (bgpl20, American Biotechnologies, 
200 nM), anti-gpl20 monoclonal antibody 110.1 (25 /ig/ml), D-mannose (100 mM), D- 
galactose (100 mM), L-fucose (100 mM), concanavalin A (1 mg/ml) or pradiniicin A (100 

25 ug/ml). The cells were monitored after autoradiography (3 days). The results seen in 
FIGURES 1 (A and B) illustrate that gpl20 binding to the gpl20r expressed on the cells 
was blocked by excess bgpl20, mannose, fucose, pradiniicin A, Concanavalin A, and 
preincubation with antibody 110.1 but not by CD4, antibody G17-2, galactose, or HIV 
antisera. Studies were also carried out on gpl20 binding to CD4 expressing COS cells, 

30 transfected with « B3MCD4 by the method of Peterson et aL (1988) Cell 54:65-72. 

Control studies of the binding of 125 I-labeIled psoralen-UV inactivated HIV-BRU 
to the gpl20r expressing COS-7 cells demonstrated binding of HIV to gpl20r and blockage 
by excess bgpl20 (FIGURE 1Q. A tabular compilation quaniitating the amount of bound 
material to the cells in FIGURE 1 is shown in Table. 1 
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TABLE 1 



FIGURE! 
A 
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1 jMrrl 1 BO 




CPM BOUND X 10 




vgpl20+ 


coMPErmoNwrm 




CD4 


— 


60 


20 




G17-2 


60 


5 




bgpl20 


6 


4 


vgpl20+ 


HIVuitisen 


63 


3 




D-gtlictofc 


50 


20 




D-nuumotc 


6 


20 




L-fucose 


6 


20 




coaeanavaUn A 


8 


6 




pyxfiniicin A 


8 


6 




OKF4A 


60 


5 




N-tcetylgilactoiamine 


60 


20 




N-tcctylglucowminc 


30 


20 




minntn 


6 


20 




rrunno*e-6-pho«phite 


60 


20 




uaBcacid 


60 


20 




human IgE 


60 


20 


HTV-BRU+ 




8 


4 




bgpl20 


2 


2 



30 



35 



40 



Scatchard plots of gpl20 binding to placental membranes and to COS cells 
expressing the gpl20r were carried out in the presence and absence of a 200 fold excess of 
bgpl20 or ngpl20. The results, shown in FIGURE 2A, demonstrate a specific binding of 
vgpl20 to gpl20r with a Kd of 1.7 nM ± 0.4 (n=4) and of ngpl20 to gpl20r with Kd of 
1.8 nM ± 0.2 (n=4), with 150,000 and 149,000 receptors per cell, respectively. 
Concurrent analysis of gpl20 binding to CD4 expressed on COS cells gave a Kd of 4-5 nM 
in agreement with previous reports (Linsley, P.S. et al. (1988) J. Virol £2:3695-3702; 
Schnittman, et al. (1988) J. Immunol. 141:4181-4186). Calculations from the association 
and dissociation rate constants gave a similar comparative result. The expressed gpl20r 
has a relative molecular mass (Mr) of "48,500 and a protein of similar size was also 
partially purified from placental membranes (FIGURE 2D). 

The placental membranes and COS cells were surface iodinated, and treated with 1 
nM unlabelled vgpl20, then washed with Blotto RPMI, 5% BSA, 1% Non-fat dry milk, 
0.2% sodium azide solubilized in Triton X-100 (1% in PBS with a protein inhibitor 
cocktail, PMSF, Pepstatin A, orthophenathroline and leupeptin) and immunoprecipitated 
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with HIV or control human sera, according to the method described in Curtis et al. (1990) 

J. Immunol. 144:1295-1303. 

Northern analysis of the expression of the gp 120r RNA indicated a major species of 
-5 kb and a minor species of "1.7 kb which may represent an alternatively processed 
transcript and is more consistent with the size of the gpl20r cDNA. RNA was denatured, 
separated in an agarose gel, transferred to nitrocdlulose, hybridized to gpl20r cDNA and 
autoradiographed for 3 days. 

Expression of gpl20r RNA was highest in colon followed by thymus, placenta, 
heart, skeletal muscle, and was not detected in liver or kidney. Low levels of expression 
in brain, T cell, B cell, and macrophage (FIGURE 2E) require verification by polymerase 
chain reaction (PCS). Full length CD4 RNA was highest in thymus, T cell, and 
macrophage followed by placenta and colon (not shown). 

The gpl20r cDNA encodes a protein of 404 amino acids with a calculated Mr of 

45,775 (FIGURE 3A). 

15 Sequencing of both strands of gpl20r cDNA was carried out by the dideoxy chain 

termination method. The nucleotide sequence preceeding the first ATG agrees with the 
Kozak consensus. The predicted cytoplasmic domain has a similar length and shows some 
sequence homology to other type U membrane protein C-type lectins (Spiess, M. (1990) 
Biochemistry 22:10009-10018). The membrane spanning sequence is underlined and was 

20 predicted in part by homology to related sequences in FIGURE 3C. The potential N- 
linked glycosylation site is marked by an asterisk. The start of the seven complete and 
eighth partial tandem repeats are indicated (R1-R8). The consensus repeat sequence is 
IYQELT(R/Q) LKAAVGELPEKSKLQE. The beginning of the lectin domains is also 
indicated (L). No signal sequence was apparent but instead demonstrated homology to a 

25 family of Type H membrane proteins which utilize a '20 residue hydrophobic stop-transfer 
sequence for membrane translocation. The "positive inside rule" (von Heijne, G. et al. 
(1988) Eur. J. Biochem. 171:671-678) for the sequence within fifteen residues of the 
transmembrane region predicts a cytoplasmic amino terminus in agreement with the 
homology to membrane associated C-type lectins with similar membrane orientation 

30 (FIGURE 3Q (Spiess, M. (1990) Biochemistry 25:10009-10018). This region, Met 1 to 
Ala 76, represents the first domain of the gpl20r sequence. 

The second domain (He 77 to Val 249) consists of tandem repeats of nearly 
identical sequence (FIGURE 3A). This region was predicted to consist of a series of 
amphipathic a-helices interrupted by B-tums. Circular Dichroism spectra in 4056 

35 trifluoroethanol of a consensus repeat peptide beginning with the fl-tum, 
PEKSKLQE1YQELTQLKAAVGEL (single-letter amino-acid code), demonstrated an all a- 
heJical structure (not shown). Homology to other repeat domains suggested three possible 
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tertiary structures, (1) antiparallel helix bundles, (2) a multimeric parallel helix bundle, and 
(3) a membrane pore with a hydrophobic exterior and a negatively charged interior. The 
first two models would function as spacers to separate the lectin domain from the 
membrane, while the third could generate a transmembrane signal after ligand binding. 
5 The third domain (Cys 253 to Ala 404) is homologous to the other known C-type 

lectins which are type II membrane proteins (FIGURE 3C). With the exception of the 
IgEr, these lectins bind terminal D-galactose and D-N-acetylgalactosamine of glycoproteins 
(Spiess, M. (1990) Biochemistry 22: 10009-10018). 

The most closely related sequences were the group of Type n membrane protein C- 

10 type lectins: Chick hepatic lectin (CHL) (Drickamer, KJ. (1981) Biol. Chem. 256:5827- 
5839), low affinity IgE receptor (IgEr) (Kikutani, H. et al. (1986) Cell 4Z: 657-665), the 
asialoglycoptorein receptors (human HI and H2 (Spiess, M. et al. (1985) Proc. Natl. 
Acad. Sci. USA 52:6465-6569) are shown), and the rat Kupffer cell receptor (Hoyle, 
G.W. et al. (1988) J. Biol. Chem. 263:7487-7492). The most similar mannose binding 

15 lectin was one of the eight carbohydrate recognition domains of the human macrophage 
mannose receptor (Mannr) (Taylor, M.E. et al. (1990) J. Biol. Chem. 265:12156-12162; 
Ezekowitz, R.A.B. et al. (1990) J. Exp. Med. 122:1785-1794). Residues identical to the 
gpl20r are boxed. ALIGN scores indicate significant sequence similarity if greater than 
3.0. The complete gp!20r sequence was most homologous to the Kupffer cell receptor 

20 which has a similar tandem repeat (Hoyle, G.W. et al. (1988) J. Biol. Chem. 262:7487- 
7492). 

The inability to crosslink gp!20 to the non-CD4 sites on placenta and brain cell 
lines (not shown) was consistent with an interaction of the gpl20r with carbohydrate, and 
polyclonal HIV antisera added to gpl20 blocked binding to CD4 but not to the gpl20r 

25 (FIGURE IB). Galactose and N-acetylgalactosamine did not block gpl20 binding, but 
mannose and fucose completely blocked binding to the gpl20r without an effect on CD4 
(FIGURE IB). Inhibition by a series of sugars is shown in FIGURE 2B. Human IgE (10 
jig/ml), sialic acid (100 mM), and mannose-6-phosphate (100 mM) had no effect on 
binding to the gpl20r. The three forms of gpl20 used have different oligosaccaride 

30 structures. Bgpl20 contains only high mannose structures (Hsieh, P. et al. (1984) J. Biol. 
Chem. 252:2375-2382), Vgpl20 has equal proportions of high mannose and complex 
(Mizuochi, T. et al. (1988) Biochem. J. 214:599-603) similar to ngpl20 which has a 
greater structural diversity in the complex chains (Geyer, H. et al. (1988) J. Biol. Chem. 
262:11760-11767; Mizuochi, T. et al. (1990) J. Biol. Chem. 265:8519-8524). The 

35 affinity of the gpl20r for all three forms was similar (FIGURE 2A ) suggesting that the 
terminal mannose of high mannose chains are the primary determinants of binding. As 
expected for a C-type lectin the gpl20r required calcium and binding was blocked by 
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EGTA (FIGURE 2B). The gpl20r carbohydrate specificity is more closely related to 
serum mannose -binding proteins and to the Mr 175,000 mannose-specific endocytosis 
receptor found in macrophages and placenta (Taylor, M.E. et aL (1990) J. Biol. Chem. 
265-12156-12162; Ezekowitz, R.A.B. et aL (1990) I. Exp. Med. 122:1785-1794) 
5 (FIGURE 3C). Low (1 nM) concentrations of gpl20 did not purify a Mr 175,000 band 
from placental membranes (FIGURE 2D) consistent with a reported concentration of 150- 
300 nM for gpl20 saturation of the macrophage receptor (Larkin, M. et al. (1989) AIDS 

3,793-798). Jfc . 

The importance of gp 120 carbohydrate in HTV infection has been suggested by the 

10 ability of plant lectins (Lifson, J. et al. (1986) E. J. Exp. Med. 1^:2101-2106) and serum 
mannose-binding protein (Ezetowitz, R.A.B. et aL (1989) I. Exp. Med. lf£ 185-196) to 
block infection, and a proposed role for the macrophage endocytosis receptor in viral 
attachment (Larking M et al. (1989) AIDS 3, 793-798). Concanavalin A treatment of 
gpl20 blocked binding to the gpI20r and CD4 (FIGURE IB), consistent with a stenc 

15 hindrance of receptor interaction. The antibiotic pradimicin A blocks BTV infection of 
CD4 positive T cells and tins inhibitory effect is prevented by mannan and EGTA (Tanabe- 
Tochikura. A. et al. (1990) Virology 1Z&476-473). Pradamicin blocked gpl20 binding to 
the gpl20r and CD4, while mannan and EGTA only inhibited binding to the gpl20r 
(FIGURE2B). Mannan inhibited "10% of high affinity (nM) gpl20 binding to T cells and 

20 macrophages, consistent with gpl20r expression (FIGURE 2E), suggesting that in addition 
to CD4 the gpl20r may be important for HTV binding and infection. The observation the 
the gpl20r rapidly internalized its bound ligand gpl20 (FIGURE 2C), and also binds 
ladiolabelled HIV in a gpl20 dependent fashion (FIGURE 1Q also support mis 
conclusion. 

25 The foregoing description and Examples are intended as illustrative of the present 

invention, but not as limiting. Numerous variations and modifications may be effected 
without departing from the true spirit and scope of the present invention. 
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(2) 

C) 



INFORMATION FOR SEQ ID NO: 1: 
SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(Q STRANDEDNESS: 

(D) TOPOLOGY: 



1312 base pairs 
nucleic acid 
double 
linear 



(ii) MOLECULE TYPE: cDNA 
(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Human immunodeficiency virus type 1 

fix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 42.. 1253 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

CTAAAGCAGG AGTTCTGGAC ACTGGGGGAG AGTGGGGTGA C ATG AGT GAC TCC 

1 

AAG GAA CCA AGA CTG CAG CAG CTG GGC CTC CTG GAG GAG GAA CAG CTG 
Lys Glu Pro Arg Leu Gin Gin Leu Gly Leu Leu Glu Glu Glu Gin Leu 
5 10 15 

AGA GGC CTT GGA TTC CGA CAG ACT OGA GGA TAC AAG AGC TTA GCA GGG 
Arg Gly Leu Gly Phe Arg Gin Thr Arg Gly Tyr Lye Ser Leu Ala Gly 
25 30 35 

TGT CTT GGC CAT GGT CCC CTG GTG CTG CAA CTC CTC TCC TTC ACG CTC 
cys Leu Gly His Gly Pro Leu Val Leu Gin Leu Leu Ser Phe Thr Leu 
40 45 50 

TTG GCT GGG CTC CTT GTC CAA GTG TCC AAG GTC CCC AGC TCC ATA AGT 
Leu Ala Gly Leu Leu Val Gin Val Ser Lys Val Pro Ser Ser He Ser 
55 60 65 

CAG GAA CAA TCC AGG CAA GAC GCG ATC TAC CAG AAC CTG ACC CAG CTT 
Gin Glu Gin Ser Arg Gin Asp Ala He Tyr Gin Asn Leu Thr Gin Leu 
70 75 80 

AAA GCT GCA GTG GGT GAG CTC TCA GAG AAA TCC AAG CTG CAG GAG ATC 
Lys Ala Ala Val Gly Glu Leu Ser Glu Lys Ser Lys Leu Gin Glu lie 
is 90 95 100 



53 



101 



149 



197 



245 



293 



341 
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TAC CAG GAG CTG ACC CAG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG 389 
Tyr Gin Glu Leu Thr Gin Leu Lys Ala Ala Val Gly Glu Leu Pro Glu 
105 HO 115 

AAA TCT AAG CTG CAG GAG ATC TAC CAG GAG CTG ACC CGG CTG AAG GCT 437 
Lys Ser Lys Leu Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu Lys Ala 
120 125 130 

GCA GTG GGT GAG CTT CCA GAG AAA TCT AAG CTG CAG GAG ATC TAC CAG 485 
Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Leu Gin Glu lie Tyr Gin 
135 140 145 

GAG CTG ACC TGG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG AAA TCT 533 
Glu Leu Thr Trp Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser 
150 155 160 

AAG ATG CAG GAG ATC TAC CAG GAG CTG ACT CGG CTG AAG GCT GCA GTG 581 
Lys Met Gin Glu lie Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val 
165 170 175 180 



GGT GAG CTT CCA GAG AAA TCT AAG CAG CAG GAG ATC TAC CAG GAG CTG 
Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu lie Tyr Gin Glu Leu 
185 190 195 



629 



ACC CGG CTG AAG GCT GCA GTG GGT GAG CTT CCA GAG AAA TCT AAG CAG 
Thr Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin 
200 205 210 



677 



CAG GAG ATC TAC CAG GAG CTG ACC CGG CTG AAG GCT GCA GTG GGT GAG 
Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val Gly Glu 
215 220 225 



725 



CTT CCA GAG AAA TCT AAG CAG CAG GAG ATC TAC CAG GAG CTG ACC CAG 
Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu Thr Gin 
230 235 240 



773 



CTG AAG GCT GCA GTG GAA CGC CTG TGC CAC CCC TGT CCC TGG GAA TGG 
Leu Lys Ala Ala Val Glu Arg Leu Cys His Pro Cys Pro Trp Glu Trp 
245 250 255 260 



821 



ACA TTC TTC CAA GGA AAC TGT TAC TTC ATG TCT AAC TCC CAG CGG AAC 
Thr Phe Phe Gin Gly Asn Cys Tyr Phe Met Ser Asn Ser Gin Arg Asn 
265 270 275 



869 



TGG CAC GAC TCC ATC ACC GCC TGC AAA GAA GTG GGG GCC CAG CTC GTC 
Trp His Asp Ser He Thr Ala Cys Lys Glu Val Gly Ala Gin Leu Val 
280 285 290 



917 



GTA ATC AAA AGT GCT GAG GAG CAG AAC TTC CTA CAG CTG CAG TCT TCC 
Val He Lys Ser Ala Glu Glu Gin Asn Phe Leu Gin Leu Gin Ser Ser 
295 300 305 



965 
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AGA AGT AAC CGC TTC ACC TGG ATG GGA CTT TCA GAT CTA AAT CAG GAA 
Arg Ser Asn Arg Phe Thr Trp Met Gly Leu Ser Asp Leu Asn Gin Glu 
310 315 320 

GGC ACG TGG CAA TGG GTG GAC GGC TCA CCT CTG TTG CCC AGC TTC AAG 
Gly Thr Trp Gin Trp Val Asp Gly Ser Pro Leu Leu Pro Ser Phe Lys 
325 330 335 340 

CAG TAT TGG AAC AGA GGA GAG CCC AAC AAC GTT GGG GAG GAA GAC TGC 
Gin Tyr Trp Asn Arg Gly Glu Pro Asn Asn Val Gly Glu Glu Asp Cys 
345 350 355 

GCG GAA TTT AGT GGC AAT GGC TGG AAC GAC GAC AAA TGT AAT CTT GCC 
Ala Glu Phe Ser Gly Asn Gly Trp Asn Asp Asp Lys Cys Asn Leu Ala 
360 365 370 

AAA TTC TGG ATC TGC AAA AAG TCC GCA GCC TCC TGC TCC AGG GAT GAA 
Lys Phe Trp lie Cys Lys Lys Ser Ala Ala Ser Cys Ser Arg Asp Glu 
375 380 385 

GAA CAG TTT CTT TCT CCA GCC CCT GCC ACC CCA AAC CCC CCT CCT GCG 
Glu Gin Phe Leu Ser Pro Ala Pro Ala Thr Pro Asn Pro Pro Pro Ala 
390 395 400 



(A) LENGTH: 

(B) TYPE: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 



404 amino acids 
amino acid 
linear 

protein 



1013 



1061 



1109 



1157 



1205 



1253 



TAGCAGAACT TCACCCCCTT TTAAGCTACA GTTCCTTCTC TCCATCCTTC GACCTTTAG 1312 

(2) INFORMATION FOR SEQ ID NO: 2: 
(i) SEQUENCE CHARACTERISTICS: 



(xi) SEQUENCE DESCRIPTION: SEQIDNO:2: 



Met Ser Asp Ser Lys Glu Pro Arg Leu Gin Gin Leu Gly Leu Leu Glu 
1 5 10 15 

Glu Glu Gin Leu Arg Gly Leu Gly Phe Arg Gin Thr Arg Gly Tyr Lys 
20 25 30 

Ser Leu Ala Gly Cys Leu Gly His Gly Pro Leu Val Leu Gin Leu Leu 
35 40 45 

Ser Phe Thr Leu Leu Ala Gly Leu Leu Val Gin Val Ser Lys Val Pro 
50 55 60 
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Ser Ser lie Ser Gin Glu Gin Ser Arg Gin Asp Ala He Tyr Gin Asn 
65 70 75 80 

Leu Thr Gin Leu Lys Ala Ala Val Gly Glu Leu Ser Glu Lys Ser Lys 
85 90 95 

Leu Gin Glu He Tyr Gin Glu Leu Thr Gin Leu Lys Ala Ala Val Gly 
100 105 HO 

Glu Leu Pro Glu Lys Ser Lys Leu Gin Glu He Tyr Gin Glu Leu Thr 
115 120 125 

Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Leu Gin 
130 135 140 

Glu He Tyr Gin Glu Leu Thr Trp Leu Lys Ala Ala Val Gly Glu Leu 
145 150 155 160 

Pro Glu Lys Ser Lys Met Gin Glu He Tyr Gin Glu Leu Thr Arg Leu 
165 170 175 

Lys Ala Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu He 
180 185 190 

Tyr Gin Glu Leu Thr Arg Leu Lys Ala Ala Val Gly Glu Leu Pro Glu 
195 200 205 

Lys Ser Lys Gin Gin Glu He Tyr Gin Glu Leu Thr Arg Leu Lys Ala 
210 215 220 

Ala Val Gly Glu Leu Pro Glu Lys Ser Lys Gin Gin Glu He Tyr Gin 
225 230 235 240 

Glu Leu Thr Gin Leu Lys Ala Ala Val Glu Arg Leu Cys His Pro Cys 
245 250 255 

Pro Trp Glu Trp Thr Phe Phe Gin Gly Asn Cys Tyr Phe Met Ser Asn 
260 265 270 

Ser Gin Arg Asn Trp His Asp Ser He Thr Ala Cys Lys Glu Val Gly 
275 280 285 

Ala Gin Leu Val Val lie Lys Ser Ala Glu Glu Gin Asn Phe Leu Gin 
290 295 300 

Leu Gin Ser Ser Arg Ser Asn Arg Phe Thr Trp Met Gly Leu Ser Asp 
305 310 315 320 

Leu Asn Gin Glu Gly Thr Trp Gin Trp Val Asp Gly Ser Pro Leu Leu 
325 330 335 

Pro Ser Phe Lys Gin Tyr Trp Asn Arg Gly Glu Pro Asn Asn Val Gly 
340 345 350 
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Glu Glu Asp cys Ala Glu Phe Ser Gly Asn Gly Trp Asn Asp Asp Lys 

360 365 



355 



Cys Asn' Leu Ala Lys Phe Trp He Cys Lys Lys Ser Ala Ala Ser Cys 
1 380 



370 



375 



Ser Arg Asp Glu Glu Gin Phe Leu Ser Pro Ala Pro Ala Thr Pro Asn 



385 

Pro Pro Pro Ala 



390 



395 



(2) INFORMATION FOR SEQ ID NO: 3: 
Ci) SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(D) TOPOLOGY: 

(ii) MOLECULE TYPE: 

(v) FRAGMENT TYPE: 

(vi) ORIGINAL SOURCE: 
(A) ORGANISM: 



127 amino acids 
amino acid 
linear 

protein 

internal 



Human immunodeficiency virus type 1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 



Cys His Pro Cys Pro Trp Glu Trp Thr Phe Phe Gin Gly Asn Cys Tyr 



10 



Phe Met Ser Asn Ser Gin Arg Asn Trp His Asp Ser He Thr Ala Cys 
20 25 30 

Lys Glu Val Gly Ala Gin Leu Val Val He Lys Ser Ala Glu Glu Gin 
35 40 4 5 

Asn Phe Leu Gin Leu Gin Ser Ser Arg Ser Asn Arg Phe Thr Trp Met 
50 55 6° 

Gly Leu Ser Asp Leu Asn Gin Glu Gly Thr Trp Gin Trp Val Asp Gly 
65 70 75 80 

Ser Pro Leu Leu Pro Ser Phe Lys Gin Tyr Trp Asn Arg Gly Glu Pro 
85 90 95 
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Asn Asn Val Gly Glu Glu Asp Cys Ala Glu Phe Ser Gly Asn Gly Trp 
100 105 HO 

Asn Asp Asp Lys Cys Asn Leu Ala Lys Phe Trp He Cys Lys Lys 
115 120 t 125 

(2) INFORMATION FOR SEQ ID NO: 4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 



Cys Gly Ala Gin Ser Arg Gin Trp Glu Tyr Phe Glu Gly Arg Cys Tyr 
15 10 15 

Tyr Phe Ser Leu Ser Arg Met Ser Trp His Lys Ala Lys Ala Glu Cys 
20 25 30 

Glu Glu Met His Ser His Leu He He He Asp Ser Tyr Ala Lys Gin 
35 40 45 

Asn Phe Val Met Phe Arg Thr Arg Asn Glu Arg Phe Trp He Gly Leu 
50 55 60 

Thr Asp Glu Asn Gin Glu Gly Glu Trp Gin Trp Val Asp Gly Thr Asp 
65 * 70 75 80 

Thr Arg Ser Ser Phe Thr Phe Trp Lys Glu Gly Glu Pro Asn Asn Arg 
85 90 95 

Gly Phe Asn Glu Asp Cys Ala His Val Trp Thr Ser Gly Gin Trp Asn 
100 105 HO 

Asp Val Tyr Cys Thr Tyr Glu Cys Tyr Tyr Val Cys Glu Lys 
115 120 125 



(2) INFORMATION FOR SEQ ID NO: 5: 
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(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 

(B) TYPE: 

(D) TOPOLOGY: 



125 amino acids 
amino acid 
linear 



(ii) MOLECULE TYPE: 



protein 



(v) FRAGMENT TYPE: 



internal 



(xi) SEQUENCE DESCRIPTION: SEQIDNO: 5: 

Cys Asn Thr Cys Pro Glu Lys Trp lie Aan Phe Gin Arg Lys Cys Tyr 
! 5 10 15 

Tyr Phe Gly Lys Gly Thr Lys Gin Trp Val His Ala Arg Tyr Ala Cys 
20 25 30 

Asp Asp Met Glu Gly Gin Leu Val Ser He His Ser Pro Glu Glu Gin 
35 40 45 

Asp Phe Leu Thr Lys His Ala Ser His Thr Gly Ser Trp He Gly Leu 
50 55 60 

Arg Asn Leu Asp Leu Lys Gly Glu Phe He Trp Val Asp Gly Ser His 
65 70 75 80 

Val Asp Tyr Ser Asn Trp Ala Pro Gly Glu Pro Thr Ser Arg Ser Gin 
85 90 95 

Gly Glu Asp Cys Val Met Met Arg Gly Ser Gly Arg Trp Asn Asp Ala 
100 105 110 

Phe Cys Asp Arg Lys Leu Gly Ala Trp Val Cys Asp Arg 



115 



120 



125 



(2) INFORMATION FOR SEQIDNO: 6: 



(i) SEQUENCE CHARACTERISTICS : 



(A) LENGTH: 

(B) TYPE: 

(D) TOPOLOGY: 



129 amino acids 
amino acid 
linear 



(ii) MOLECULE TYPE: 



protein 
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(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQIDNO: 6: 



Arg Thr Cys Cys Pro Val Asn Trp Val Glu His Glu Arg Ser Cys Tyr 
15 10 15 

Trp Phe Ser Arg Ser Gly Lys Ala Trp Ala Asp Ala Asp Asn Tyr Cys 
20 25 30 

Arg Leu Glu Asp Ala His Leu Val Val Val Thr Ser Trp Glu Glu Gin 
35 40 45 

Lys Phe Val Gin His His He Gly Pro Val Asn Thr Trp Met Gly Leu 
50 55 60 

His Asp Gin Asn Gly Pro Trp Lys Trp Val Asp Gly Thr Asp Tyr Glu 
65 70 75 80 

Thr Gly Phe Lys Asn Trp Arg Pro Glu Gin Pro Asp Asp Trp Tyr Gly 
85 90 95 

His Gly Leu Gly Gly Gly Glu Asp Cys Ala His Phe Thr Asp Asp Gly 
100 105 110 

Arg Trp Asn Asp Asp Val Cys Gin Arg Pro Tyr Arg Trp Val Cys Glu 
115 120 125 

Thr 



(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 129 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 
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(xi) SEQUENCE DESCRIPTION: SEQIDNO: 7: 



Arg Thr Cys Cys Pro Val Asn Trp Val Glu His Gin Gly Ser Cys Tyr 
1 5 10 15 

Trp Phe Ser His Ser Gly Lys Ala Trp Ala Glu Ala Glu Lys Tyr Cys 
20 25 30 

Gin Leu Glu Asn Ala His Leu Val Val lie Asn Ser Trp Glu Glu Gin 
35 40 45 

Lys Phe lie Val Gin His Thr Asn Pro Phe Asn Thr Trp lie Gly Leu 
50 55 60 

Thr Asp Ser Asp Gly Ser Trp Lys Trp Val Asp Gly Thr Asp Tyr Arg 
65 ~ 70 75 80 

His Asn Tyr Lys Asn Trp Ala Val Thr Gin Pro Asp Asn Trp His Gly 
85 90 95 

His Glu Leu Gly Gly Ser Glu Asp Cys Val Glu Val Gin Pro Asp Gly 
100 105 110 

Arg Trp Asn Asp Asp Phe Cys Leu Gin Val Tyr Arg Trp Val Cys Glu 
115 120 125 

Lys 



(2) 



INFORMATION FOR SEQ ID NO: 8: 



SEQUENCE CHARACTERISTICS: 



(A) LENGTH: 

(B) TYPE: 

(D) TOPOLOGY: 



130 amino adds 
amino acid 
linear 



(ii) 



MOLECULE TYPE: 



protein 



(v) 



FRAGMENT TYPE: 



internal 



(xi) 



SEQUENCE DESCRIPTION: 



SEQIDNO: 8: 



Leu Gin Leu lie Met Gin Asp Trp Lys Tyr Phe Asn Gly Lys Phe Tyr 
15 10 15 
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Tyr Phe Ser Arg Asp Lys Lys Ser Trp His Glu Ala Glu Asn Phe Cys 
20 25 30 

Val Ser Gin Gly Ala His Leu Ala Ser Val Thr Ser Gin Glu Glu Gin 
35 40 45 

Ala Phe Leu Val Gin He Thr Asn Ala Val Asp His Trp He Gly Leu 
50 55 60 

Thr Asp Gin Gly Thr Glu Gly Asn Trp Arg Trp Val Asp Gly Thr Pro 
65 ' 70 75 80 

Phe Asp Tyr Val Gin Ser Arg Arg Phe Trp Arg Lys Gly Gin Pro Asp 
85 90 95 

Asn Trp Arg His Gly Asn Gly Glu Arg Glu Asp Cys Val His Leu Gin 
100 105 HO 

Arg Met Trp Asn Asp Met Ala Cys Gly Thr Ala Tyr Asn Trp Val Cys 
115 120 125 



Lys Lys 
130 



(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 130 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 
(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 

Pro Thr His Cys Pro Ser Gin Trp Trp Pro Tyr Ala Gly His Cys Tyr 
15 10 15 

Lys He His Arg Asp Glu Lys Lys He Gin Arg Asp Ala Leu Thr Thr 
20 25 30 

Cys Arg Lys Glu Gly Gly Asp Leu Thr Ser He His Thr He Glu Glu 
35 40 45 



SUBSTITUTE SHEET 



PCI7US92/05985 

WO 93/01820 _ 

- 28 - 



Leu Asp 

• SO 



Phe lie lie Sar Gin Leu Gly Leu Glu Pro Asn Asp Glu Leu 



55 60 



Trp lie Gly Leu Asn Asp He Lys II. Gin Met Tyr Phe Glu Trp Ser 



65 



70 75- 



Asp Gly Thr Pro Val Thr Phe Thr Lys Trp Leu Arg Gly Glu Pro Ser 



85 



90 



His Glu Asn Asn Arg Gin Glu Asp Cys Val Val Met Lys Gly Lys Asp 
100 105 

Gly Tyr Trp Ala Asp Arg Gly Cys Glu Trp Pro Leu Gly Tyr lie Cys 



115 



120 



Lys Met 
130 
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We claim: 



2. 

3. 

4. 

5. 

6. 

7. 

8. 

10. 

10. 

11. 

12. 
13. 
14. 



A method of inhibiting HIV infection of mammalian cells comprising contacting the 
cells with an effective amount of a compound selected from the group consisting of 
a mannose carbohydrate, a fucose carbohydrate, a lectin and a drug, for a time 
period sufficient to significantly inhibit the binding of HIV to a non-CD4 cell 
surface protein. 

The method of Claim 1, wherein the non-CD4 cell surface protein is a gpl20 
receptor having a specific binding affinity for gpl20 of about Kd = 1.3 nM to 
about Kd = 2.0 nM. 

The method of Claim 2, wherein the gpl20 receptor is present on placental cells. 
The method of Claim 2, wherein the gpl20 receptor is present on muscle cells. 
The method of Claim 2, wherein the gpl20 receptor is present on neural cells. 
The method of Claim 5, wherein the neural cells are brain cells. 
The method of Claim 5, wherein the neural cells are dendritic cells. 
The method of Claim 2, wherein the gpl20 receptor is present on mucosal cells. 
The method of Claim 1, wherein the compound is mannose. 
The method of Claim 1, wherein the compound is fucose. 

The method of Claim 1, wherein the compound is a mannose-containing 
carbohydrate. 

The method of Claim 11, where the carbohydrate is mannan. 

The method of Claim 1, wherein the compound is a pradimicin A antibiotic. 

A substantially purified non-CD4 gpl20 receptor protein comprising a protein 
substantially corresponding to a non-CD4 mammalian cell surface protein that has a 
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specific binding affinity for gpl20, said protein containing about 400 amino acid 
residues, having a molecular weight of about 45,000 daltons and having a binding 
affinity for gpl20 characterized by a Kd of about 1.3 nM to about 2 nM. 

15. The gpl20 receptor protein of Claim 14, wherein the binding of the gpl20 receptor 
protein to gpl20 is inhibited by a compound selected from the group consisting of a 
mannose carbohydrate, a fucose carbohydrate, a lectin and a drug. 

16. The gpl20 receptor of Claim 15, wherein the compound is mannose. 

17. The gpl20 receptor protein of Claim 15, wherein the compound is a pradimicin A 
antibiotic. 

18. The gpl20 receptor protein of Claim 14, wherein the protein is produced by 
recombinant means. 

19. The gpl20 receptor protein of Claim 18, wherein said recombinant means 
comprises the cloning of a cDNA isolated from a library of recombinant placental 
genes. 

20. A DNA molecule encoding the gpl20 receptor protein of Claim 14, wherein the 
DNA is a complementary DNA that transcribes an mRNA found in cells selected 
from the group consisting of placental cells, brain cells, muscle cells and colon 
cells. 

21. A method of detecting the presence of HIV in a sample comprising: 

(a) admixing in an aqueous medium a sample to be assayed with a non- 
CD4 gpl20 receptor protein having a specific binding affinity for gpl20 
characterized by a Kd of about 1.3 nM to about 2.0 nM in an amount 
sufficient to carry out at least one assay; 

(b) maintaining the admixture for a time period sufficient for the gpl20 
receptor protein to bind to any HIV present in the sample and form a 
reaction product; and 

(c) determining the presence of the HIV containing reaction product 
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The method of Claim 21, wherein the gpl20 receptor protein contains about 
400 amino acid residues and has a molecular weight of about 45,000 
daltons. 

The method of Claim 21, wherein the gpl20 receptor protein is affixed to a 
solid matrix to form a solid support. 

The method of Claim 21, wherein the presence of the reaction product is 
determined by contacting the sample with a reagent capable of detecting the 
bound gpl20 receptor protein. 

The method of Claim 24, wherein the reagent is a labelled antibody directed 
against the gpl20 receptor protein. 

, 26. A diagnostic system in kit form, for assaying for the presence of HIV in a 

* fluid sample, comprising a package containing a non-CD4 receptor protein 
k having a specific affinity for gpl20 characterized by a Kd of about 1.3 nM 
,^ to about 2.0 nM, and instructions for use. 

* ■ 

' 27. The diagnostic system of Claim 26, wherein the non-CD4 gpl20 receptor 

* protein is affixed to a solid matrix to form a solid support. 



22. 



23. 
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Figure 1A 
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Figure 1C 
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Figure 2A 
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Figure 2B 
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Figure 2C 
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Figure 2D 
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Figure 2E 
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42 
1 



102 
21 



CTAAAGCAGGAGTTCTGGACACTGGGGGAGAGTGGGGTGAC 

ATGAGTGACTCCAAGGAACCAAGACTGCAGCAGCTGGGCCTCCTGGAGGAGGAACAGCTG 
MSD SK EPRLQQLGLLEE EQ L 

AGAGGCCTTGGATTCCGACAGACTCGAGGATACAAGAGCI^AGCAGGGTGTCTTGGCCAT 
RGLGFRQTRGYKSLAGCLGH 



162 GGTCCCCTGGTGCTGCAACTCCTCTCCTTCA 
41 CPLVLOLLSFTLLAGT.IiV QV 

222 TCCAAGGTCCCCAGCTCCATAAGTCAGGAACAATCCAGGCAAGACGCGATCTACCAGAAC 
61 SKVPSSISQEQSRQDAIYQN 

Rl * 

282 CTGACCCAGCTTAAAGCTGCAGTGGGTGAGCTCTCAGAGAAATCCAAGCTGCAGGAGATC 
81 LTQLKAAVGELSEKSKLQEI 

R2 

342 TACCAGGAGCTGACCCAGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCTG 
101 YQELTQLKAAVGELPEKSKL 

402 CAGGAGATCTACCAGGAGCTGACCCGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAA 
121 QEIYQELTRLKAAVGELPEK 

R3 

462 TCTAAGCTGCAGGAGATCTACCAGGAGCTGACCTGGCTGAAGGCTGCAGTGGGTGAGCTT 
141 SKLQEIYQELTWLKAAVGEL 

R4 

522 CCAGAGAAATCTAAGATGCAGGAGATCTACCAGGAGCTGACTCGGCTGAAGGCTGCAGTG 
161 PEKSKMQEIYQELTRLKAAV 

R5 

582 GGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACCCGGCTGAAG 
181 GELPEKSKQQEIYQELTRLK 

R6 

642 GCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAGGAGCTGACC 
201 AAVGELPEKSKQQEIYQELT 

R7 

702 CGGCTGAAGGCTGCAGTGGGTGAGCTTCCAGAGAAATCTAAGCAGCAGGAGATCTACCAG 
221 RLKAAVGELPEKSKQQEIYQ 

R8 

762 GAGCTGACCCAGCTGAAGGCTGCAGTGGAACGCCTGTGCCACCCCTGTCCCTGGG AATGG 
241 ELTQ LKAAVERLCH P CPW EW 

L 

822 ACATTCTTCCAAGGAAACTGTTACTTCATGTCTAACTCCCAGCGGAACTGGCACGACTCC 
261 TFFQGNCYFMSNSQRNWHDS 



Figure 3A 
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882 
281 

942 
301 

1002 
321 

1062 
341 

1122 
361 

1182 
381 

1242 
401 

1302 



ATCACCGCCTGCAAAGAAGTGGGGGCCCAGCTCGTCGTAATCAAAAGTGCTGAGGAGCAG 
ITACKEVGAQLVVIKSAEEQ 

AACTTCCTACAGCTGCAGTCTTCCAGAAGTAACCGCTTCACCTGGATGGGACTTTCAGAT 
NFLQLQSSRSNRFTWMGLSD 

CTAAATCAGGAAGGCACGTGGCAATGGGTGGACGGCTCACCTCTGTTGCCCAGCTTCAAG 
L NQEGTWQWVDGSPLLP S FK 

CAGTATTGGAACAGAGGAGAGCCCAACAACGTTGGGGAGGAAGACTGCGCGGAATTTAGT 
QYWNRGEPNNVGEEDCAEFS 

GGCAATGGCTGGAACGACGACAAATGTAATCTTGCCAAATTCTGGATCTGCAAAAAGTCC 
GNGWNDDKCNLAKFWI CKKS 

GCAGCCTCCTGCTCCAGGGATGAAGAACAGTTTCTTTCTCCAGCCCCTGCCACCCCAAAC 
AASCSRDEEQFLSPAPATPN 

CCCCCTCCTGCGTAGCAGAACTTCACCCCCTTTTAAGCTACAGTTCCTTCTCTCCATCCT 
P P P A *** 

TCGACCTTTAG 



Figure 3A(cont.) 
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Figure 3B 
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Figure 3C 

SUBSTITUTE SHEET 



J It ^ 



